N-terminal Proteomics Assisted Profiling of the Unexplored Translation Initiation Landscape in Arabidopsis thaliana *

نویسندگان

  • Patrick Willems
  • Elvis Ndah
  • Veronique Jonckheere
  • Simon Stael
  • Adriaan Sticker
  • Lennart Martens
  • Frank Van Breusegem
  • Kris Gevaert
  • Petra Van Damme
چکیده

Proteogenomics is an emerging research field yet lacking a uniform method of analysis. Proteogenomic studies in which N-terminal proteomics and ribosome profiling are combined, suggest that a high number of protein start sites are currently missing in genome annotations. We constructed a proteogenomic pipeline specific for the analysis of N-terminal proteomics data, with the aim of discovering novel translational start sites outside annotated protein coding regions. In summary, unidentified MS/MS spectra were matched to a specific N-terminal peptide library encompassing protein N termini encoded in the Arabidopsis thaliana genome. After a stringent false discovery rate filtering, 117 protein N termini compliant with N-terminal methionine excision specificity and indicative of translation initiation were found. These include N-terminal protein extensions and translation from transposable elements and pseudogenes. Gene prediction provided supporting protein-coding models for approximately half of the protein N termini. Besides the prediction of functional domains (partially) contained within the newly predicted ORFs, further supporting evidence of translation was found in the recently released Araport11 genome re-annotation of Arabidopsis and computational translations of sequences stored in public repositories. Most interestingly, complementary evidence by ribosome profiling was found for 23 protein N termini. Finally, by analyzing protein N-terminal peptides, an in silico analysis demonstrates the applicability of our N-terminal proteogenomics strategy in revealing protein-coding potential in species with well- and poorly-annotated genomes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification and Expression Analysis of Two Arabidopsis LRR-Protein Encoding Genes Responsive to Some Abiotic Stresses

AbstractTwo Arabidopsis thaliana genes, psr9.2 and psr9.4 appearedto be highly similar to a phosphate-starved induced gene,psr9, isolated from Brassica nigra suspension cells.Sequence analysis classified the encoded polypeptides asmembers of leucine-rich repeat (LRR) proteins superfamily.The sequence of psr9 proteins comprise a unique N-terminalregion e...

متن کامل

Positional proteomics reveals differences in N-terminal proteoform stability.

To understand the impact of alternative translation initiation on a proteome, we performed a proteome-wide study on protein turnover using positional proteomics and ribosome profiling to distinguish between N-terminal proteoforms of individual genes. By combining pulsed SILAC with N-terminal COFRADIC, we monitored the stability of 1,941 human N-terminal proteoforms, including 147 N-terminal pro...

متن کامل

Target proteins of the cytosolic thioredoxins in Arabidopsis thaliana.

Possible target proteins of cytosolic thioredoxin in higher plants have been investigated in the cell lysate of dark-grown Arabidopsis thaliana whole tissues. We immobilized a mutant of cytosolic thioredoxin, in which an internal cysteine at the active site was substituted with serine, on CNBr activated resin, and used the resin for the thioredoxin-affinity chromatography. By using this resin, ...

متن کامل

Yeast Two Hybrid cDNA Screening of Arabidopsis thaliana for SETH4 Protein Interaction

SETH4 coding sequence with 2013 bp is a member of gene family expressed in gametophytic tissues of Arabidopsis thaliana. This fragment was PCR amplified using Kod Hi Fi DNA polymerase enzyme. This fragment was cloned into pGBKT7 bate vector and transformed E. coli DH5? cells containing vector were selected on LB medium containing Kanamycin. Finally, pGBKT7-SETH4 bate was transformed into yeast ...

متن کامل

Arabidopsis thaliana proteomics: from proteome to genome.

Proteomics has become an important approach for investigating cellular processes and network functions. Significant improvements have been made during the last few years in technologies for high-throughput proteomics, both at the level of data analysis software and mass spectrometry hardware. As proteomics technologies advance and become more widely accessible, efforts of cataloguing and quanti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2017